Large Scale Metric Learning for Matching of Heterogeneous Multimedia Data
نویسندگان
چکیده
Heterogeneous multimedia data are widely encountered in many applications, such as photo-sketch face recognition, still image to video face recognition, cross-modality image synthesis, cross media retrieval, etc. With the ubiquitous use of digital imaging devices, mobile terminals and social networks, there are lots of heterogeneous and homogeneous data from multiple sources, e.g., news media websites, microblog, mobile phone, social networking, etc. Matching of heterogeneous multimedia data becomes increasingly important to achieve cross modal and cross media information retrieval. One popular approach to the matching of heterogeneous data is metric learning, which learns a positive semi-definite matrix to measure the similarity of heterogeneous data. However, there are several challenging issues that the current metric learning methods cannot adequately address. First, the current metric learning methods are limited in dealing with the highly diverse and complex data types in real-current methods have poor scalability, which is a critical issue in handling the tremendous amount of multimedia data. Third, the labels of most data are unavailable, making them difficult to be used by current metric learning methods. Fourth, the data in the same modality may have different representations, and thus a multiple feature
منابع مشابه
Composite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملHeterogeneous Metric Learning for Cross-Modal Multimedia Retrieval
Due to the massive explosion of multimedia content on the web, users demand a new type of information retrieval, called cross-modal multimedia retrieval where users submit queries of one media type and get results of various other media types. Performing effective retrieval of heterogeneous multimedia content brings new challenges. One essential aspect of these challenges is to learn a heteroge...
متن کاملEye-Tracking Method’ Usage for Understanding the Cognitive Processes in Multimedia Learning
Introduction: Designing multimedia learning environments should consist of the evidence-based study and principals about the human learning process. Eye tracking is a way based on the learner processing of learning materials which presented in multimedia learning environments. The aim of the study was to examine the use of the eye-tracking method to investigate the cognitive processes in m...
متن کاملHeterogeneous Metric Learning with Joint Graph Regularization for Cross-Media Retrieval
As the major component of big data, unstructured heterogeneous multimedia content such as text, image, audio, video and 3D increasing rapidly on the Internet. User demand a new type of cross-media retrieval where user can search results across various media by submitting query of any media. Since the query and the retrieved results can be of different media, how to learn a heterogeneous metric ...
متن کاملLocally adaptive subspace and similarity metric learning for visual data clustering and retrieval
Subspace and similarity metric learning are important issues for image and video analysis in the scenarios of both computer vision and multimedia fields. Many real-world applications, such as image clustering/labeling and video indexing/retrieval, involve feature space dimensionality reduction as well as feature matching metric learning. However, the loss of information from dimensionality redu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014